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IN THE CLAIMS: 

The text of all pending claims, (including withdrawn claims) is set forth below. Cancelled 
and not entered claims are indicated with claim number and status only. The claims as listed 
below show added text with underlining and deleted text with str i koth rough . The status of each 
claim is indicated with one of (original), (currently amended), (cancelled), (withdrawn), (new), 
(previously presented), or (not entered). 

1 . (canceled) 

2. (currently amended) The program method according to claim 4 6, wherein 
the LU decomposition is executed in parallel by each processor of each node in a 

recursive procedure. 

3. (currently amended) The program method according to claim 4 6, wherein 

in said update step, while computing a row block, each node transfers data that belongs 
to a computed block and is needed to update other blocks, to other nodes in parallel to the 
computation. 

4. (currently amended) The program method according to claim 4 6, wherein 

said parallel computer is a SMP node distributed-memory type parallel computer in which 
each node is a SMP (symmetric multi-processor). 

5. (currently amended) A matrix processing device of a parallel computer in which a 
plurality of processors and a plurality of nodes including memory are connected through a 
network, comprising: 

a first allocation unit .d i str i but i ng and a ll ocat i ng ono comb i nat i on of bund l es of row b l ocks 
of a matr i x, cyc li ca ll y a ll ocated, to oach nodo i n ordor to procoss tho comb i nat i on of tho bund l es 
dividing an array A(1 :k,1 :k) of a matrix to be processed by the number n of nodes and assigning 

divided matrices subarravs A(1 :k/n,1:k) A(k(n-1)/n:k, 1 :k), dividing one of the subarravs into 

narrow block by integer m, and each node's reading data from memory so that a first narrow 

block is placed in a first node, a second narrow block is placed in a second node a m-th 

narrow block is placed in a (mod(m-1 ,n)+1 )-th node ; 

a separation unit separat i ng a combinat i on of bund l es of b l ocks i nto a d i agona l b l ock, a 
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co l umn b l ock und e r th e d i agona l b l ock and oth e r b l ocks eliminating data corresponding to 
diagonal blocks A(nbase:nbase+m, nbase:nbase+m), where nbase and m are intergers, from 
data of the narrow blocks placed in each node, at each node ; 

a second allocation unit redundantly allocating same data as the diagonal block which is 
eliminated at each node to each node commonly and a l so a ll ocat i ng on e of b l ocks obta i n e d by 
on e- d i m e ns i ona ll y d i v i d i ng th e co l umn b l ock, to each of th e p l ura li ty of nod e s wh ile 
commun i cat i ng i n para llel; 

an LU decomposition unit applying LU decomposition to both the diagonal block and the 
allocated block in parallel in each node wh ile commun i cat i ng among nod e s ; and 

an update unit updating the ©the* blocks of the matrix which is not vet LU-decomposed , 
using an LU-decomposed block , at each node . 

6. (currently amended) A matrix processing method of a parallel computer in which 
a plurality of processors and a plurality of nodes including memory are connected through a 
network, comprising: 

d i str i but i ng and a ll ocat i ng on e comb i nat i on of bund l es of row b l ocks of a matr i x, cyc li ca ll y 
a ll ocat e d, to e ach nod e i n ord e r to process th e comb i nat i on of th e bund le s dividing an array 
Ad :k,1 :k) of a matrix to be processed by the number n of nodes and assigning divided matrices 

subarrays A(1 :k/n,1 :k) A(k(n-1 )/n:k, 1 :k), dividing one of the subarravs into narrow block by 

integer m, and each node's reading data from memory so that a first narrow block is placed in a 

first node, a second narrow block is placed in a second node a m-th narrow block is placed 

in a (mod(m-1 ,n)+1 )-th node ; 

s e parat i ng a comb i nat i on of bund le s of blocks i nto a d i agona l b l ock, a co l umn b l ock 
und e r th e d i agona l b l ock and oth e r b l ocks eliminating data corresponding to diagonal blocks 
A(nbase:nbase+m, nbase:nbase+m), where nbase and m are intergers, from data of the narrow 
blocks placed in each node, at each node ; 

redundantly allocating same data as the diagonal block which is eliminated at each node 
to each node commonly and a l so a ll ocat i ng one of b l ocks obta i n e d by on e- d i m e ns i ona ll y 
d i v i d i ng th e co l umn b l ock, to e ach of th e plura li ty of nod e s wh ile commun i cat i ng i n para llel; 

applying LU decomposition to both the diagonal block and the allocated block in parallel 
in each node wh ile commun i cat i ng among nod e s ; and 

updating the etbef blocks of the matrix which is not vet LU-decomposed , using an LU- 
decomposed block , at each node. 
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7. (currently amended) A computer-readable storage medium on which is recorded 
a program for enabling a computer to realize a matrix processing method oi as a parallel 
computer in which a plurality of processors and a plurality of nodes including memory are 
connected through a network, the method comprising: 

d i str i but i ng and a ll ocat i ng on e combinat i on of bund le s of row b l ocks of a matr i x, cyc li ca ll y 
a ll ocat e d, to e ach nod e i n ord e r to proc e ss the comb i nat i on of th e bund le s dividing an array 
A(1 :k,1 :k) of a matrix to be processed by the number n of nodes and assigning divided matrices 

subarrays A(1 :k/n,1 :k) A(k(n-1 )/n:k, 1 :k), dividing one of the subarrays into narrow block by 

integer m, and each node's reading data from memory so that a first narrow block is placed in a 

first node, a second narrow block is placed in a second node a m-th narrow block is placed 

in a (mod(m-1 ,n)+1 )-th node ; 

s e parat i ng a comb i nat i on of bund le s of blocks i nto a d i agona l b l ock, a co l umn b l ock 
und e r th e d i agona l b l ock and oth e r b l ocks eliminating data corresponding to diagonal blocks 
A(nbase:nbase+m, nbase:nbase+m), where nbase and m are intergers, from data of the narrow 
blocks placed in each node, at each node ; 

redundantly allocating same data as the diagonal block which is eliminated at each node 
to each node commonly and a l so a l locat i ng on e of b l ocks obta i n e d by on e- d i m e ns i ona ll y 
d i v i d i ng th e co l umn b l ock, to e ach of th e p l ura li ty of nod e s wh il e commun i cat i ng i n para llel; 

applying LU decomposition to both the diagonal block and the allocated block in parallel 
in each node wh ile commun i cat i ng among nod e s ; and 

updating the ether blocks of the matrix which is not vet LU-decomposed , using the LU- 
decomposed block , at each node . 
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